Voice Activity Detection for Ultrasound-Based Silent Speech Interfaces Using Convolutional Neural Networks
نویسندگان
چکیده
Voice Activity Detection (VAD) is not easy task when the input audio signal noisy, and it even more complicated an recording. This case with Silent Speech Interfaces (SSI) where we record movement of articulatory organs during speech, aim to reconstruct speech from this Our SSI system synthesizes ultrasonic videos tongue movement, quality resulting signals are evaluated by metrics such as mean squared error loss function underlying neural network Mel-Cepstral Distortion (MCD) reconstructed compared original. Here, first demonstrate that amount silence in training data can have influence both on MCD evaluation metric performance model. Then, train a convolutional classifier separate silent speech-containing ultrasound images, using conventional VAD algorithm create labels corresponding signal. In experiments our ultrasound-based speech/silence separator achieved classification accuracy about 85\% AUC score around 86\%.
منابع مشابه
Low frequency ultrasonic voice activity detection using convolutional neural networks
Low frequency ultrasonic mouth state detection uses reflected audio chirps from the face in the region of the mouth to determine lip state, whether open, closed or partially open. The chirps are located in a frequency range just above the threshold of human hearing and are thus both inaudible as well as unaffected by interfering speech, yet can be produced and sensed using inexpensive equipment...
متن کاملVoice Conversion using Convolutional Neural Networks
The human auditory system is able to distinguish the vocal source of thousands of speakers, yet not much is known about what features the auditory system uses to do this. Fourier Transforms are capable of capturing the pitch and harmonic structure of the speaker but this alone proves insufficient at identifying speakers uniquely. The remaining structure, often referred to as timbre, is critical...
متن کاملVessel Detection in Ultrasound Images Using Deep Convolutional Neural Networks
Deep convolutional neural networks have achieved great results on image classification problems. In this paper, a new method using a deep convolutional neural network for detecting blood vessels in Bmode ultrasound images is presented. Automatic blood vessel detection may be useful in medical applications such as deep venous thrombosis detection, anesthesia guidance and catheter placement. The ...
متن کاملConvolutional networks for speech detection
In this paper, we introduce a new framework for speech detection using convolutional networks. We propose a network architecture that can incorporate long and short-term temporal and spectral correlations of speech in the detection process. The proposed design is able to address many shortcomings of existing speech detectors in a unified new framework: First, it improves the robustness of the s...
متن کاملDetection of schizophrenia patients using convolutional neural networks from brain effective connectivity maps of electroencephalogram signals
Background: Schizophrenia is a mental disorder that severely affects the perception and relations of individuals. Nowadays, this disease is diagnosed by psychiatrists based on psychiatric tests, which is highly dependent on their experience and knowledge. This study aimed to design a fully automated framework for the diagnosis of schizophrenia from electroencephalogram signals using advanced de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-83527-9_43